提供者:杜成玉
下载地址:https://catalog.ldc.upenn.edu/LDC2002S09
概述
数据来源:https://www.zhihu.com/question/63383992/answer/222718972
该数据集由NIST(国家标准与技术研究院)2000年发起的HUB5评估中使用的40个英语电话对话的成绩单组成,其仅包含英语的语音数据集,百度最近的论文《深度语音:扩展端对端语音识别》使用的是这个数据集。推荐应用方向:音乐、人声、车辆、乐器、室内等自然和人物声音识别。
相关论文
[1]Hain T, Woodland P C, Evermann G, et al. New features in the CU-HTK system for transcription of conversational telephone speech[C]//Acoustics, Speech, and Signal Processing, 2001. Proceedings.(ICASSP’01). 2001 IEEE International Conference on. IEEE, 2001, 1: 57-60.
[2]Seide F, Li G, Chen X, et al. Feature engineering in context-dependent deep neural networks for conversational speech transcription[C]//Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on. IEEE, 2011: 24-29.
[3]Sundaram R, Ganapathiraju A, Hamaker J, et al. ISIP 2000 conversational speech evaluation system[C]//Speech Transcription Workshop, College Park, Maryland, USA. 2000.
[4]Woodland P C, Povey D. Large scale MMIE training for conversational telephone speech recognition[C]//Proc. Speech Transcription Workshop. 2000, 2(2).